Sequence-to-Sequence Models for Punctuated Transcription Combing Lexical and Acoustic Features

نویسندگان

  • Ondřej Klejch
  • Peter Bell
  • Steve Renals
چکیده

In this paper we present an extension of our previously described neural machine translation based system for punctuated transcription. This extension allows the system to map from per frame acoustic features to word level representations by replacing the traditional encoder in the encoder-decoder architecture with a hierarchical encoder. Furthermore, we show that a system combining lexical and acoustic features significantly outperforms systems using only a single source of features on all measured punctuation marks. The combination of lexical and acoustic features achieves a significant improvement in F-Measure of 1.5 absolute over the purely lexical neural machine translation based system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Evaluation of Nucleic Acid Sequence Based Amplification (NASBA) and Reverse Transcription Polymerase Chain Reaction for Detection of Coxsackievirus B3 in Cell Culture and Animal Tissue Samples

Enteroviruses are the causative agents of a number of diseases in humans. Group B coxsackieviruses are believed to be the most common viral agents responsible for human heart disease. Genomic data of enteroviruses has allowed developing new molecular approaches such as Nucleic Acid Sequence Based Amplification (NASBA) for detection of such viruses. In this study, coxsackievirus B3 (CVB3) was de...

متن کامل

Automatic Labeling of Intonation Using Acoustic and Lexical Features

This paper proposes a framework of automatic intonation labeling which involves detection and classification of pitch accents and phrase boundaries. Four statistical models are designed to perform these tasks on the basis of a compact and simple representation consisting of features identified as the main acoustic correlates of accentual prominence and phrase boundaries or describing the acoust...

متن کامل

Impact of Layout Sequence of the Natural and Synthetic Adsorbents in Double-Layered Composites on Improving the Natural Fiber Acoustic Performance Using the Numerical Finite Element Method

Introduction: The acoustic performance of natural fiber adsorbents has been investigated in numerous studies. A part of these materials show a poor adsorption within the frequency range of less than 1000 Hz. In the present study, attempts were made to investigate the effect of layout sequence of double-layered composites consisting of natural and synthetic fibers on improving the acoustic adsor...

متن کامل

Cloning and Bioinformatics Analysis of the Gene Encoding Transcription Factor MYB44 of Sunflower (Helianthus annuus L.) under Salt Stress Conditions

Sunflower oilseeds (Helianthus annuus L.) are widely used around the world. Soil salinity negatively affects many morphological and physiological traits of sunflowers. Oil seed sunflower line tolerant to salinity stress (AS5305) was planted in normal and salinity stress conditions in a completely randomized design with two biological replications in a controlled environment. Salinity was applie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017